Deep Reinforcement Learning for Decentralized Multi-Robot Exploration With Macro Actions
نویسندگان
چکیده
Cooperative multi-robot teams need to be able explore cluttered and unstructured environments while dealing with communication dropouts that prevent them from exchanging local information maintain team coordination. Therefore, robots consider high-level teammate intentions during action selection. In this letter, we present the first Macro Action Decentralized Exploration Network (MADE-Net) using multi-agent deep reinforcement learning (DRL) address challenges of exploration in unseen, unstructured, environments. Simulated robot experiments were conducted compared against classical DRL methods where MADE-Net outperformed all benchmark terms computation time, total travel distance, number interactions between robots, rate across various degrees dropouts. A scalability study 3D showed a decrease time increasing environment sizes. The presented highlight effectiveness robustness our method.
منابع مشابه
Deep Reinforcement Learning With Macro-Actions
Deep reinforcement learning has been shown to be a powerful framework for learning policies from complex high-dimensional sensory inputs to actions in complex tasks, such as the Atari domain. In this paper, we explore output representation modeling in the form of temporal abstraction to improve convergence and reliability of deep reinforcement learning approaches. We concentrate on macro-action...
متن کاملExploration for Multi-task Reinforcement Learning with Deep Generative Models
Exploration in multi-task reinforcement learning is critical in training agents to deduce the underlying MDP. Many of the existing exploration frameworks such as E, Rmax, Thompson sampling assume a single stationary MDP and are not suitable for system identification in the multi-task setting. We present a novel method to facilitate exploration in multi-task reinforcement learning using deep gen...
متن کاملTowards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning
Developing a safe and efficient collision avoidance policy for multiple robots is challenging in the decentralized scenarios where each robot generate its paths without observing other robots’ states and intents. While other distributed multirobot collision avoidance systems exist, they often require extracting agent-level features to plan a local collision-free action, which can be computation...
متن کاملPlanning with macro-actions in decentralized POMDPs
Decentralized partially observable Markov decision processes (Dec-POMDPs) are general models for decentralized decision making under uncertainty. However, they typically model a problem at a low level of granularity, where each agent’s actions are primitive operations lasting exactly one time step. We address the case where each agent has macroactions: temporally extended actions which may requ...
متن کاملEX2: Exploration with Exemplar Models for Deep Reinforcement Learning
Deep reinforcement learning algorithms have been shown to learn complex tasks using highly general policy classes. However, sparse reward problems remain a significant challenge. Exploration methods based on novelty detection have been particularly successful in such settings but typically require generative or predictive models of the observations, which can be difficult to train when the obse...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE robotics and automation letters
سال: 2023
ISSN: ['2377-3766']
DOI: https://doi.org/10.1109/lra.2022.3224667